voice mode
Anthropic's Claude AI reportedly getting two-way voice mode soon
According to Bloomberg, AI startup Anthropic is about to release a voice mode for Claude. Currently, it's only possible to communicate with the Claude AI assistant via text, and adding voice mode would bring it up to parity with other advanced AIs like ChatGPT, Gemini, and Sesame. Claude's voice mode will initially only be available in English, with three different voice modes named "Airy," "Mellow," and "Buttery." According to Bloomberg, Claude's voice mode could be released as early as April, but it will initially be rolled out to a limited number of users. Anthropic hasn't yet commented on Claude's voice mode.
If I Don't Use AI, Will My Grandkids Still Think I'm Cool?
As a retiree, I want to stay close to my grandkids. I worry that not learning how to use AI will leave me behind. What's the easiest tool for me to learn, and should I be worried? I promise that you do not need to learn how to use a generative AI tool like ChatGPT or Claude to ensure your grandkids see you as a relevant, informed person. If anything, I would say that our culture has tipped over the past year to generally oppose the use of generative AI tools due to their outsize environmental impact, ethical concerns over their data scraping, and general sludginess of the outputs.
How to use AI to get a job interview and nail it โ along with the salary you deserve
The fear that artificial intelligence (AI) will replace millions of jobs is widespread. But equally, in today's tough job market, not using AI wisely as part of your search could mean you miss out. You can use AI models such as ChatGPT and Perplexity to research employers, competitors and industry trends before applying for a job. Hannah Salton, a careers coach, says some of her clients have successfully used AI to find out more about companies, allowing them to "gain insights into culture, competitors and market positioning. It can also help identify SMEs [small and medium-sized enterprises] to apply to or network with."
Tips for ChatGPT's Voice Mode? Best AI Uses for Retirees? Our Expert Answers Your Questions
Thank you so much to all the readers who tuned in live to participate in the second installment of our question and answer series focused on artificial intelligence. I was thrilled to see so many questions come in before the event, as well as all the questions that were dropped into the chat during our conversation. Below is a replay of this event that WIRED subscribers can watch whenever. Also, the livestream from the first one is available here. I started off the chat with a couple quick demos showing how to use the image and voice features built into chatbots, including an example of how it's possible to interact with ChatGPT's Advanced Voice Mode as a kind of Duolingo-style language learning tool.
OpenAI rolls out advanced Voice Mode and no, it won't sound like ScarJo
OpenAI has started rolling out its advanced Voice Mode feature. Starting today, a small number of paying ChatGPT users will be able to have a tete-a-tete with the AI chatbot. All ChatGPT Plus members should receive access to the expanded toolset by the fall of this year. In an announcement on X, the company said this advanced version of its Voice Mode "offers more natural, real-time conversations, allows you to interrupt anytime, and senses and responds to your emotions." We're starting to roll out advanced Voice Mode to a small group of ChatGPT Plus users.
OpenAI has delayed its seductive ChatGPT voice assistants
If you've been dreaming about spending your summer whispering sweet nothings into the digital ears of one of the seductive ChatGPT voice assistants that OpenAI showed off last month, you'll have to dream a little longer. On Tuesday, the company announced that its "advanced Voice Mode" feature needs more time in the oven "to reach our bar to launch." The feature will be available to a small group of users to gather feedback, and then launch to all paying ChatGPT customers in the fall. "We're improving the model's ability to detect and refuse certain content," OpenAI posted on X. "We're also working on improving the user experience and preparing our infrastructure to scale to millions while maintaining real-time responses." We're sharing an update on the advanced Voice Mode we demoed during our Spring Update, which we remain very excited about: We had planned to start rolling this out in alpha to a small group of ChatGPT Plus users in late June, but need one more month to reach our bar to launch.โฆ Voices have been a part of ChatGPT since 2023.
Voice Jailbreak Attacks Against GPT-4o
Shen, Xinyue, Wu, Yixin, Backes, Michael, Zhang, Yang
Recently, the concept of artificial assistants has evolved from science fiction into real-world applications. GPT-4o, the newest multimodal large language model (MLLM) across audio, vision, and text, has further blurred the line between fiction and reality by enabling more natural human-computer interactions. However, the advent of GPT-4o's voice mode may also introduce a new attack surface. In this paper, we present the first systematic measurement of jailbreak attacks against the voice mode of GPT-4o. We show that GPT-4o demonstrates good resistance to forbidden questions and text jailbreak prompts when directly transferring them to voice mode. This resistance is primarily due to GPT-4o's internal safeguards and the difficulty of adapting text jailbreak prompts to voice mode. Inspired by GPT-4o's human-like behaviors, we propose VoiceJailbreak, a novel voice jailbreak attack that humanizes GPT-4o and attempts to persuade it through fictional storytelling (setting, character, and plot). VoiceJailbreak is capable of generating simple, audible, yet effective jailbreak prompts, which significantly increases the average attack success rate (ASR) from 0.033 to 0.778 in six forbidden scenarios. We also conduct extensive experiments to explore the impacts of interaction steps, key elements of fictional writing, and different languages on VoiceJailbreak's effectiveness and further enhance the attack performance with advanced fictional writing techniques. We hope our study can assist the research community in building more secure and well-regulated MLLMs.
Pandora's new voice search feature knows what you want to hear
It's been almost two years since Pandora launched its on-demand music streaming service. In that time, the company has done a solid job of fixing some of the issues that cropped up at launch and even adding some features the competition hasn't got to yet (like downloading songs to an Apple Watch for offline playback). Today, Pandora's adding another feature that some of its competitors have: Voice Mode. But, as usual, Pandora believes that the amount of information it has on both the music in its catalog as well as its users will set its voice features apart. For starters, Pandora built Voice Mode internally, from the ground up, something Chief Product Officer Chris Phillips says was key in Voice Mode being a more personal music assistant.